Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 547271 |
| Missing cells | 629315 |
| Missing cells (%) | 5.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 87.7 MiB |
| Average record size in memory | 168.0 B |
Variable types
| DateTime | 1 |
|---|---|
| Categorical | 4 |
| Numeric | 12 |
| Text | 4 |
ARR_DELAY is highly overall correlated with CANCELLED and 2 other fields | High correlation |
ARR_TIME is highly overall correlated with CANCELLED and 1 other fields | High correlation |
CANCELLATION_CODE is highly overall correlated with CANCELLED and 2 other fields | High correlation |
CANCELLED is highly overall correlated with ARR_DELAY and 2 other fields | High correlation |
DEP_DELAY is highly overall correlated with ARR_DELAY | High correlation |
DEP_TIME is highly overall correlated with ARR_TIME | High correlation |
DEST_AIRPORT_ID is highly overall correlated with DEST_AIRPORT_SEQ_ID and 1 other fields | High correlation |
DEST_AIRPORT_SEQ_ID is highly overall correlated with DEST_AIRPORT_ID and 1 other fields | High correlation |
DEST_CITY_MARKET_ID is highly overall correlated with DEST_AIRPORT_ID and 1 other fields | High correlation |
DIVERTED is highly overall correlated with ARR_DELAY and 1 other fields | High correlation |
OP_UNIQUE_CARRIER is highly overall correlated with CANCELLATION_CODE | High correlation |
ORIGIN_AIRPORT_ID is highly overall correlated with ORIGIN_AIRPORT_SEQ_ID and 1 other fields | High correlation |
ORIGIN_AIRPORT_SEQ_ID is highly overall correlated with ORIGIN_AIRPORT_ID and 1 other fields | High correlation |
ORIGIN_CITY_MARKET_ID is highly overall correlated with ORIGIN_AIRPORT_ID and 1 other fields | High correlation |
CANCELLED is highly imbalanced (77.0%) | Imbalance |
DIVERTED is highly imbalanced (97.3%) | Imbalance |
DEP_TIME has 19784 (3.6%) missing values | Missing |
DEP_DELAY has 19858 (3.6%) missing values | Missing |
TAXI_OUT has 20257 (3.7%) missing values | Missing |
ARR_TIME has 20633 (3.8%) missing values | Missing |
ARR_DELAY has 21901 (4.0%) missing values | Missing |
CANCELLATION_CODE has 526882 (96.3%) missing values | Missing |
DEP_DELAY has 23234 (4.2%) zeros | Zeros |
ARR_DELAY has 8971 (1.6%) zeros | Zeros |
Reproduction
| Analysis started | 2024-09-18 02:39:36.512949 |
|---|---|
| Analysis finished | 2024-09-18 02:40:18.212898 |
| Duration | 41.7 seconds |
| Software version | ydata-profiling vv4.10.0 |
| Download configuration | config.json |
FL_DATE
Date
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| Minimum | 2024-01-01 00:00:00 |
|---|---|
| Maximum | 2024-01-31 00:00:00 |
OP_UNIQUE_CARRIER
Categorical
HIGH CORRELATION 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| WN | |
|---|---|
| AA | |
| DL | |
| UA | |
| OO | |
| Other values (10) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1094542 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 9E |
|---|---|
| 2nd row | 9E |
| 3rd row | 9E |
| 4th row | 9E |
| 5th row | 9E |
Common Values
| Value | Count | Frequency (%) |
| WN | 115389 | |
| AA | 77346 | |
| DL | 74384 | |
| UA | 58855 | |
| OO | 56814 | |
| YX | 22914 | 4.2% |
| MQ | 20750 | 3.8% |
| NK | 20415 | 3.7% |
| B6 | 19580 | 3.6% |
| AS | 17775 | 3.2% |
| Other values (5) | 63049 |
Length
| Value | Count | Frequency (%) |
| wn | 115389 | |
| aa | 77346 | |
| dl | 74384 | |
| ua | 58855 | |
| oo | 56814 | |
| yx | 22914 | 4.2% |
| mq | 20750 | 3.8% |
| nk | 20415 | 3.7% |
| b6 | 19580 | 3.6% |
| as | 17775 | 3.2% |
| Other values (5) | 63049 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 237898 | |
| N | 135804 | |
| O | 130154 | |
| W | 115389 | |
| D | 74384 | 6.8% |
| L | 74384 | 6.8% |
| U | 58855 | 5.4% |
| 9 | 31351 | 2.9% |
| H | 23102 | 2.1% |
| Y | 22914 | 2.1% |
| Other values (11) | 190307 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1094542 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 237898 | |
| N | 135804 | |
| O | 130154 | |
| W | 115389 | |
| D | 74384 | 6.8% |
| L | 74384 | 6.8% |
| U | 58855 | 5.4% |
| 9 | 31351 | 2.9% |
| H | 23102 | 2.1% |
| Y | 22914 | 2.1% |
| Other values (11) | 190307 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1094542 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 237898 | |
| N | 135804 | |
| O | 130154 | |
| W | 115389 | |
| D | 74384 | 6.8% |
| L | 74384 | 6.8% |
| U | 58855 | 5.4% |
| 9 | 31351 | 2.9% |
| H | 23102 | 2.1% |
| Y | 22914 | 2.1% |
| Other values (11) | 190307 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1094542 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 237898 | |
| N | 135804 | |
| O | 130154 | |
| W | 115389 | |
| D | 74384 | 6.8% |
| L | 74384 | 6.8% |
| U | 58855 | 5.4% |
| 9 | 31351 | 2.9% |
| H | 23102 | 2.1% |
| Y | 22914 | 2.1% |
| Other values (11) | 190307 |
OP_CARRIER_FL_NUM
Real number (ℝ)
| Distinct | 5914 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2344.8843 |
| Minimum | 1 |
|---|---|
| Maximum | 8819 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 295 |
| Q1 | 1083 |
| median | 2069 |
| Q3 | 3454 |
| 95-th percentile | 5384 |
| Maximum | 8819 |
| Range | 8818 |
| Interquartile range (IQR) | 2371 |
Descriptive statistics
| Standard deviation | 1576.2543 |
|---|---|
| Coefficient of variation (CV) | 0.67220983 |
| Kurtosis | -0.6721993 |
| Mean | 2344.8843 |
| Median Absolute Deviation (MAD) | 1148 |
| Skewness | 0.56529359 |
| Sum | 1.2832872 × 109 |
| Variance | 2484577.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 698 | 295 | 0.1% |
| 1245 | 275 | 0.1% |
| 687 | 270 | < 0.1% |
| 555 | 266 | < 0.1% |
| 777 | 264 | < 0.1% |
| 321 | 256 | < 0.1% |
| 1279 | 255 | < 0.1% |
| 336 | 254 | < 0.1% |
| 311 | 253 | < 0.1% |
| 396 | 253 | < 0.1% |
| Other values (5904) | 544630 |
| Value | Count | Frequency (%) |
| 1 | 146 | |
| 2 | 122 | |
| 3 | 120 | |
| 4 | 124 | |
| 5 | 106 | |
| 6 | 94 | |
| 7 | 135 | |
| 8 | 98 | |
| 9 | 153 | |
| 10 | 144 |
| Value | Count | Frequency (%) |
| 8819 | 3 | |
| 8818 | 2 | < 0.1% |
| 8817 | 1 | < 0.1% |
| 8811 | 2 | < 0.1% |
| 8810 | 1 | < 0.1% |
| 8809 | 2 | < 0.1% |
| 8808 | 3 | |
| 8807 | 1 | < 0.1% |
| 8806 | 1 | < 0.1% |
| 8804 | 5 |
ORIGIN_AIRPORT_ID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 334 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12659.023 |
| Minimum | 10135 |
|---|---|
| Maximum | 16869 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 10135 |
|---|---|
| 5-th percentile | 10397 |
| Q1 | 11292 |
| median | 12889 |
| Q3 | 14027 |
| 95-th percentile | 14893 |
| Maximum | 16869 |
| Range | 6734 |
| Interquartile range (IQR) | 2735 |
Descriptive statistics
| Standard deviation | 1526.2764 |
|---|---|
| Coefficient of variation (CV) | 0.12056827 |
| Kurtosis | -1.2942228 |
| Mean | 12659.023 |
| Median Absolute Deviation (MAD) | 1591 |
| Skewness | 0.1064243 |
| Sum | 6.9279161 × 109 |
| Variance | 2329519.8 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10397 | 26315 | 4.8% |
| 11298 | 23570 | 4.3% |
| 11292 | 23361 | 4.3% |
| 13930 | 20321 | 3.7% |
| 11057 | 16378 | 3.0% |
| 14107 | 15378 | 2.8% |
| 12892 | 15228 | 2.8% |
| 12889 | 14942 | 2.7% |
| 13204 | 14296 | 2.6% |
| 12953 | 12700 | 2.3% |
| Other values (324) | 364782 |
| Value | Count | Frequency (%) |
| 10135 | 349 | 0.1% |
| 10136 | 151 | < 0.1% |
| 10140 | 1808 | |
| 10141 | 61 | < 0.1% |
| 10146 | 62 | < 0.1% |
| 10155 | 93 | < 0.1% |
| 10157 | 148 | < 0.1% |
| 10158 | 262 | < 0.1% |
| 10165 | 9 | < 0.1% |
| 10170 | 56 | < 0.1% |
| Value | Count | Frequency (%) |
| 16869 | 149 | < 0.1% |
| 16218 | 111 | < 0.1% |
| 15991 | 60 | < 0.1% |
| 15919 | 979 | |
| 15841 | 60 | < 0.1% |
| 15624 | 553 | |
| 15607 | 62 | < 0.1% |
| 15582 | 52 | < 0.1% |
| 15569 | 53 | < 0.1% |
| 15412 | 1117 |
ORIGIN_AIRPORT_SEQ_ID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 334 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1265906.2 |
| Minimum | 1013506 |
|---|---|
| Maximum | 1686902 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 1013506 |
|---|---|
| 5-th percentile | 1039707 |
| Q1 | 1129202 |
| median | 1288904 |
| Q3 | 1402702 |
| 95-th percentile | 1489302 |
| Maximum | 1686902 |
| Range | 673396 |
| Interquartile range (IQR) | 273500 |
Descriptive statistics
| Standard deviation | 152627.43 |
|---|---|
| Coefficient of variation (CV) | 0.12056772 |
| Kurtosis | -1.2942287 |
| Mean | 1265906.2 |
| Median Absolute Deviation (MAD) | 159098 |
| Skewness | 0.10642538 |
| Sum | 6.9279377 × 1011 |
| Variance | 2.3295134 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1039707 | 26315 | 4.8% |
| 1129806 | 23570 | 4.3% |
| 1129202 | 23361 | 4.3% |
| 1393008 | 20321 | 3.7% |
| 1105703 | 16378 | 3.0% |
| 1410702 | 15378 | 2.8% |
| 1289208 | 15228 | 2.8% |
| 1288904 | 14942 | 2.7% |
| 1320402 | 14296 | 2.6% |
| 1295304 | 12700 | 2.3% |
| Other values (324) | 364782 |
| Value | Count | Frequency (%) |
| 1013506 | 349 | 0.1% |
| 1013603 | 151 | < 0.1% |
| 1014005 | 1808 | |
| 1014106 | 61 | < 0.1% |
| 1014602 | 62 | < 0.1% |
| 1015502 | 93 | < 0.1% |
| 1015706 | 148 | < 0.1% |
| 1015804 | 262 | < 0.1% |
| 1016506 | 9 | < 0.1% |
| 1017004 | 56 | < 0.1% |
| Value | Count | Frequency (%) |
| 1686902 | 149 | < 0.1% |
| 1621802 | 111 | < 0.1% |
| 1599102 | 60 | < 0.1% |
| 1591905 | 979 | |
| 1584102 | 60 | < 0.1% |
| 1562404 | 553 | |
| 1560702 | 62 | < 0.1% |
| 1558203 | 52 | < 0.1% |
| 1556903 | 53 | < 0.1% |
| 1541206 | 1117 |
ORIGIN_CITY_MARKET_ID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 311 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31751.846 |
| Minimum | 30070 |
|---|---|
| Maximum | 35991 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 30070 |
|---|---|
| 5-th percentile | 30194 |
| Q1 | 30647 |
| median | 31454 |
| Q3 | 32467 |
| 95-th percentile | 34570 |
| Maximum | 35991 |
| Range | 5921 |
| Interquartile range (IQR) | 1820 |
Descriptive statistics
| Standard deviation | 1320.5635 |
|---|---|
| Coefficient of variation (CV) | 0.041590133 |
| Kurtosis | -0.2889393 |
| Mean | 31751.846 |
| Median Absolute Deviation (MAD) | 994 |
| Skewness | 0.80497411 |
| Sum | 1.7376865 × 1010 |
| Variance | 1743888 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 31703 | 33930 | 6.2% |
| 30194 | 29607 | 5.4% |
| 30397 | 26315 | 4.8% |
| 30977 | 26207 | 4.8% |
| 32575 | 24575 | 4.5% |
| 30325 | 23361 | 4.3% |
| 30852 | 23025 | 4.2% |
| 32467 | 18674 | 3.4% |
| 32457 | 17503 | 3.2% |
| 31057 | 16378 | 3.0% |
| Other values (301) | 307696 |
| Value | Count | Frequency (%) |
| 30070 | 56 | < 0.1% |
| 30073 | 46 | < 0.1% |
| 30107 | 30 | < 0.1% |
| 30113 | 60 | < 0.1% |
| 30135 | 349 | 0.1% |
| 30136 | 151 | < 0.1% |
| 30140 | 1808 | |
| 30141 | 61 | < 0.1% |
| 30146 | 62 | < 0.1% |
| 30155 | 93 | < 0.1% |
| Value | Count | Frequency (%) |
| 35991 | 60 | < 0.1% |
| 35841 | 60 | < 0.1% |
| 35582 | 52 | < 0.1% |
| 35569 | 53 | < 0.1% |
| 35550 | 62 | < 0.1% |
| 35412 | 1117 | |
| 35411 | 93 | < 0.1% |
| 35401 | 69 | < 0.1% |
| 35389 | 62 | < 0.1% |
| 35380 | 211 | < 0.1% |
ORIGIN
Text
| Distinct | 334 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1641813 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | JFK |
|---|---|
| 2nd row | MSP |
| 3rd row | JFK |
| 4th row | RIC |
| 5th row | DTW |
| Value | Count | Frequency (%) |
| atl | 26315 | 4.8% |
| dfw | 23570 | 4.3% |
| den | 23361 | 4.3% |
| ord | 20321 | 3.7% |
| clt | 16378 | 3.0% |
| phx | 15378 | 2.8% |
| lax | 15228 | 2.8% |
| las | 14942 | 2.7% |
| mco | 14296 | 2.6% |
| lga | 12700 | 2.3% |
| Other values (324) | 364782 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 186539 | 11.4% |
| L | 153698 | 9.4% |
| S | 139156 | 8.5% |
| D | 127493 | 7.8% |
| T | 87421 | 5.3% |
| C | 84060 | 5.1% |
| O | 82470 | 5.0% |
| M | 74361 | 4.5% |
| F | 67709 | 4.1% |
| W | 63837 | 3.9% |
| Other values (16) | 575069 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1641813 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 186539 | 11.4% |
| L | 153698 | 9.4% |
| S | 139156 | 8.5% |
| D | 127493 | 7.8% |
| T | 87421 | 5.3% |
| C | 84060 | 5.1% |
| O | 82470 | 5.0% |
| M | 74361 | 4.5% |
| F | 67709 | 4.1% |
| W | 63837 | 3.9% |
| Other values (16) | 575069 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1641813 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 186539 | 11.4% |
| L | 153698 | 9.4% |
| S | 139156 | 8.5% |
| D | 127493 | 7.8% |
| T | 87421 | 5.3% |
| C | 84060 | 5.1% |
| O | 82470 | 5.0% |
| M | 74361 | 4.5% |
| F | 67709 | 4.1% |
| W | 63837 | 3.9% |
| Other values (16) | 575069 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1641813 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 186539 | 11.4% |
| L | 153698 | 9.4% |
| S | 139156 | 8.5% |
| D | 127493 | 7.8% |
| T | 87421 | 5.3% |
| C | 84060 | 5.1% |
| O | 82470 | 5.0% |
| M | 74361 | 4.5% |
| F | 67709 | 4.1% |
| W | 63837 | 3.9% |
| Other values (16) | 575069 |
ORIGIN_CITY_NAME
Text
| Distinct | 328 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 29 |
| Mean length | 13.097882 |
| Min length | 8 |
Characters and Unicode
| Total characters | 7168091 |
|---|---|
| Distinct characters | 56 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | New York, NY |
|---|---|
| 2nd row | Minneapolis, MN |
| 3rd row | New York, NY |
| 4th row | Richmond, VA |
| 5th row | Detroit, MI |
| Value | Count | Frequency (%) |
| tx | 58037 | 4.5% |
| ca | 57574 | 4.5% |
| fl | 56092 | 4.4% |
| ny | 28471 | 2.2% |
| ga | 28164 | 2.2% |
| san | 27793 | 2.2% |
| co | 27365 | 2.1% |
| il | 27323 | 2.1% |
| atlanta | 26315 | 2.1% |
| new | 26236 | 2.0% |
| Other values (398) | 917191 |
Most occurring characters
| Value | Count | Frequency (%) |
| 733290 | 10.2% | |
| a | 551247 | 7.7% |
| , | 547271 | 7.6% |
| o | 393065 | 5.5% |
| e | 378791 | 5.3% |
| n | 349775 | 4.9% |
| t | 341475 | 4.8% |
| l | 317301 | 4.4% |
| i | 271204 | 3.8% |
| r | 261354 | 3.6% |
| Other values (46) | 3023318 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7168091 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 733290 | 10.2% | |
| a | 551247 | 7.7% |
| , | 547271 | 7.6% |
| o | 393065 | 5.5% |
| e | 378791 | 5.3% |
| n | 349775 | 4.9% |
| t | 341475 | 4.8% |
| l | 317301 | 4.4% |
| i | 271204 | 3.8% |
| r | 261354 | 3.6% |
| Other values (46) | 3023318 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7168091 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 733290 | 10.2% | |
| a | 551247 | 7.7% |
| , | 547271 | 7.6% |
| o | 393065 | 5.5% |
| e | 378791 | 5.3% |
| n | 349775 | 4.9% |
| t | 341475 | 4.8% |
| l | 317301 | 4.4% |
| i | 271204 | 3.8% |
| r | 261354 | 3.6% |
| Other values (46) | 3023318 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7168091 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 733290 | 10.2% | |
| a | 551247 | 7.7% |
| , | 547271 | 7.6% |
| o | 393065 | 5.5% |
| e | 378791 | 5.3% |
| n | 349775 | 4.9% |
| t | 341475 | 4.8% |
| l | 317301 | 4.4% |
| i | 271204 | 3.8% |
| r | 261354 | 3.6% |
| Other values (46) | 3023318 |
DEST_AIRPORT_ID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 334 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12659.086 |
| Minimum | 10135 |
|---|---|
| Maximum | 16869 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 10135 |
|---|---|
| 5-th percentile | 10397 |
| Q1 | 11292 |
| median | 12889 |
| Q3 | 14027 |
| 95-th percentile | 14893 |
| Maximum | 16869 |
| Range | 6734 |
| Interquartile range (IQR) | 2735 |
Descriptive statistics
| Standard deviation | 1526.236 |
|---|---|
| Coefficient of variation (CV) | 0.12056447 |
| Kurtosis | -1.2941919 |
| Mean | 12659.086 |
| Median Absolute Deviation (MAD) | 1591 |
| Skewness | 0.10643995 |
| Sum | 6.9279507 × 109 |
| Variance | 2329396.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10397 | 26294 | 4.8% |
| 11298 | 23588 | 4.3% |
| 11292 | 23357 | 4.3% |
| 13930 | 20327 | 3.7% |
| 11057 | 16382 | 3.0% |
| 14107 | 15367 | 2.8% |
| 12892 | 15220 | 2.8% |
| 12889 | 14943 | 2.7% |
| 13204 | 14304 | 2.6% |
| 12953 | 12718 | 2.3% |
| Other values (324) | 364771 |
| Value | Count | Frequency (%) |
| 10135 | 349 | 0.1% |
| 10136 | 151 | < 0.1% |
| 10140 | 1807 | |
| 10141 | 61 | < 0.1% |
| 10146 | 62 | < 0.1% |
| 10155 | 93 | < 0.1% |
| 10157 | 149 | < 0.1% |
| 10158 | 262 | < 0.1% |
| 10165 | 9 | < 0.1% |
| 10170 | 57 | < 0.1% |
| Value | Count | Frequency (%) |
| 16869 | 149 | < 0.1% |
| 16218 | 111 | < 0.1% |
| 15991 | 60 | < 0.1% |
| 15919 | 977 | |
| 15841 | 60 | < 0.1% |
| 15624 | 553 | |
| 15607 | 62 | < 0.1% |
| 15582 | 52 | < 0.1% |
| 15569 | 53 | < 0.1% |
| 15412 | 1119 |
DEST_AIRPORT_SEQ_ID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 334 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1265912.6 |
| Minimum | 1013506 |
|---|---|
| Maximum | 1686902 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 1013506 |
|---|---|
| 5-th percentile | 1039707 |
| Q1 | 1129202 |
| median | 1288904 |
| Q3 | 1402702 |
| 95-th percentile | 1489302 |
| Maximum | 1686902 |
| Range | 673396 |
| Interquartile range (IQR) | 273500 |
Descriptive statistics
| Standard deviation | 152623.39 |
|---|---|
| Coefficient of variation (CV) | 0.12056393 |
| Kurtosis | -1.2941977 |
| Mean | 1265912.6 |
| Median Absolute Deviation (MAD) | 159098 |
| Skewness | 0.10644102 |
| Sum | 6.9279723 × 1011 |
| Variance | 2.3293899 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1039707 | 26294 | 4.8% |
| 1129806 | 23588 | 4.3% |
| 1129202 | 23357 | 4.3% |
| 1393008 | 20327 | 3.7% |
| 1105703 | 16382 | 3.0% |
| 1410702 | 15367 | 2.8% |
| 1289208 | 15220 | 2.8% |
| 1288904 | 14943 | 2.7% |
| 1320402 | 14304 | 2.6% |
| 1295304 | 12718 | 2.3% |
| Other values (324) | 364771 |
| Value | Count | Frequency (%) |
| 1013506 | 349 | 0.1% |
| 1013603 | 151 | < 0.1% |
| 1014005 | 1807 | |
| 1014106 | 61 | < 0.1% |
| 1014602 | 62 | < 0.1% |
| 1015502 | 93 | < 0.1% |
| 1015706 | 149 | < 0.1% |
| 1015804 | 262 | < 0.1% |
| 1016506 | 9 | < 0.1% |
| 1017004 | 57 | < 0.1% |
| Value | Count | Frequency (%) |
| 1686902 | 149 | < 0.1% |
| 1621802 | 111 | < 0.1% |
| 1599102 | 60 | < 0.1% |
| 1591905 | 977 | |
| 1584102 | 60 | < 0.1% |
| 1562404 | 553 | |
| 1560702 | 62 | < 0.1% |
| 1558203 | 52 | < 0.1% |
| 1556903 | 53 | < 0.1% |
| 1541206 | 1119 |
DEST_CITY_MARKET_ID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 311 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31751.864 |
| Minimum | 30070 |
|---|---|
| Maximum | 35991 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 30070 |
|---|---|
| 5-th percentile | 30194 |
| Q1 | 30647 |
| median | 31454 |
| Q3 | 32467 |
| 95-th percentile | 34570 |
| Maximum | 35991 |
| Range | 5921 |
| Interquartile range (IQR) | 1820 |
Descriptive statistics
| Standard deviation | 1320.5728 |
|---|---|
| Coefficient of variation (CV) | 0.041590402 |
| Kurtosis | -0.28881987 |
| Mean | 31751.864 |
| Median Absolute Deviation (MAD) | 994 |
| Skewness | 0.80502134 |
| Sum | 1.7376874 × 1010 |
| Variance | 1743912.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 31703 | 33949 | 6.2% |
| 30194 | 29625 | 5.4% |
| 30397 | 26294 | 4.8% |
| 30977 | 26209 | 4.8% |
| 32575 | 24564 | 4.5% |
| 30325 | 23357 | 4.3% |
| 30852 | 23027 | 4.2% |
| 32467 | 18657 | 3.4% |
| 32457 | 17501 | 3.2% |
| 31057 | 16382 | 3.0% |
| Other values (301) | 307706 |
| Value | Count | Frequency (%) |
| 30070 | 57 | < 0.1% |
| 30073 | 46 | < 0.1% |
| 30107 | 30 | < 0.1% |
| 30113 | 60 | < 0.1% |
| 30135 | 349 | 0.1% |
| 30136 | 151 | < 0.1% |
| 30140 | 1807 | |
| 30141 | 61 | < 0.1% |
| 30146 | 62 | < 0.1% |
| 30155 | 93 | < 0.1% |
| Value | Count | Frequency (%) |
| 35991 | 60 | < 0.1% |
| 35841 | 60 | < 0.1% |
| 35582 | 52 | < 0.1% |
| 35569 | 53 | < 0.1% |
| 35550 | 62 | < 0.1% |
| 35412 | 1119 | |
| 35411 | 93 | < 0.1% |
| 35401 | 69 | < 0.1% |
| 35389 | 62 | < 0.1% |
| 35380 | 211 | < 0.1% |
DEST
Text
| Distinct | 334 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1641813 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DTW |
|---|---|
| 2nd row | CLE |
| 3rd row | RIC |
| 4th row | JFK |
| 5th row | MKE |
| Value | Count | Frequency (%) |
| atl | 26294 | 4.8% |
| dfw | 23588 | 4.3% |
| den | 23357 | 4.3% |
| ord | 20327 | 3.7% |
| clt | 16382 | 3.0% |
| phx | 15367 | 2.8% |
| lax | 15220 | 2.8% |
| las | 14943 | 2.7% |
| mco | 14304 | 2.6% |
| lga | 12718 | 2.3% |
| Other values (324) | 364771 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 186546 | 11.4% |
| L | 153656 | 9.4% |
| S | 139145 | 8.5% |
| D | 127527 | 7.8% |
| T | 87412 | 5.3% |
| C | 84075 | 5.1% |
| O | 82470 | 5.0% |
| M | 74356 | 4.5% |
| F | 67709 | 4.1% |
| W | 63854 | 3.9% |
| Other values (16) | 575063 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1641813 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 186546 | 11.4% |
| L | 153656 | 9.4% |
| S | 139145 | 8.5% |
| D | 127527 | 7.8% |
| T | 87412 | 5.3% |
| C | 84075 | 5.1% |
| O | 82470 | 5.0% |
| M | 74356 | 4.5% |
| F | 67709 | 4.1% |
| W | 63854 | 3.9% |
| Other values (16) | 575063 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1641813 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 186546 | 11.4% |
| L | 153656 | 9.4% |
| S | 139145 | 8.5% |
| D | 127527 | 7.8% |
| T | 87412 | 5.3% |
| C | 84075 | 5.1% |
| O | 82470 | 5.0% |
| M | 74356 | 4.5% |
| F | 67709 | 4.1% |
| W | 63854 | 3.9% |
| Other values (16) | 575063 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1641813 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 186546 | 11.4% |
| L | 153656 | 9.4% |
| S | 139145 | 8.5% |
| D | 127527 | 7.8% |
| T | 87412 | 5.3% |
| C | 84075 | 5.1% |
| O | 82470 | 5.0% |
| M | 74356 | 4.5% |
| F | 67709 | 4.1% |
| W | 63854 | 3.9% |
| Other values (16) | 575063 |
DEST_CITY_NAME
Text
| Distinct | 328 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 29 |
| Mean length | 13.09795 |
| Min length | 8 |
Characters and Unicode
| Total characters | 7168128 |
|---|---|
| Distinct characters | 56 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Detroit, MI |
|---|---|
| 2nd row | Cleveland, OH |
| 3rd row | Richmond, VA |
| 4th row | New York, NY |
| 5th row | Milwaukee, WI |
| Value | Count | Frequency (%) |
| tx | 58057 | 4.5% |
| ca | 57564 | 4.5% |
| fl | 56083 | 4.4% |
| ny | 28494 | 2.2% |
| ga | 28148 | 2.2% |
| san | 27783 | 2.2% |
| co | 27358 | 2.1% |
| il | 27325 | 2.1% |
| atlanta | 26294 | 2.1% |
| new | 26245 | 2.0% |
| Other values (398) | 917203 |
Most occurring characters
| Value | Count | Frequency (%) |
| 733283 | 10.2% | |
| a | 551217 | 7.7% |
| , | 547271 | 7.6% |
| o | 393126 | 5.5% |
| e | 378730 | 5.3% |
| n | 349741 | 4.9% |
| t | 341490 | 4.8% |
| l | 317290 | 4.4% |
| i | 271167 | 3.8% |
| r | 261421 | 3.6% |
| Other values (46) | 3023392 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7168128 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 733283 | 10.2% | |
| a | 551217 | 7.7% |
| , | 547271 | 7.6% |
| o | 393126 | 5.5% |
| e | 378730 | 5.3% |
| n | 349741 | 4.9% |
| t | 341490 | 4.8% |
| l | 317290 | 4.4% |
| i | 271167 | 3.8% |
| r | 261421 | 3.6% |
| Other values (46) | 3023392 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7168128 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 733283 | 10.2% | |
| a | 551217 | 7.7% |
| , | 547271 | 7.6% |
| o | 393126 | 5.5% |
| e | 378730 | 5.3% |
| n | 349741 | 4.9% |
| t | 341490 | 4.8% |
| l | 317290 | 4.4% |
| i | 271167 | 3.8% |
| r | 261421 | 3.6% |
| Other values (46) | 3023392 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7168128 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 733283 | 10.2% | |
| a | 551217 | 7.7% |
| , | 547271 | 7.6% |
| o | 393126 | 5.5% |
| e | 378730 | 5.3% |
| n | 349741 | 4.9% |
| t | 341490 | 4.8% |
| l | 317290 | 4.4% |
| i | 271167 | 3.8% |
| r | 261421 | 3.6% |
| Other values (46) | 3023392 |
DEP_TIME
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 1424 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 19784 |
| Missing (%) | 3.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1328.4876 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 600 |
| Q1 | 914 |
| median | 1325 |
| Q3 | 1738 |
| 95-th percentile | 2127 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 824 |
Descriptive statistics
| Standard deviation | 498.97377 |
|---|---|
| Coefficient of variation (CV) | 0.37559535 |
| Kurtosis | -0.98257414 |
| Mean | 1328.4876 |
| Median Absolute Deviation (MAD) | 412 |
| Skewness | 0.028119391 |
| Sum | 7.0075994 × 108 |
| Variance | 248974.82 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 555 | 1378 | 0.3% |
| 556 | 1160 | 0.2% |
| 557 | 1116 | 0.2% |
| 558 | 1108 | 0.2% |
| 554 | 1099 | 0.2% |
| 559 | 1060 | 0.2% |
| 655 | 1046 | 0.2% |
| 600 | 1012 | 0.2% |
| 658 | 988 | 0.2% |
| 553 | 984 | 0.2% |
| Other values (1414) | 516536 | |
| (Missing) | 19784 | 3.6% |
| Value | Count | Frequency (%) |
| 1 | 73 | |
| 2 | 54 | |
| 3 | 54 | |
| 4 | 31 | |
| 5 | 44 | |
| 6 | 40 | |
| 7 | 39 | |
| 8 | 34 | |
| 9 | 28 | < 0.1% |
| 10 | 44 |
| Value | Count | Frequency (%) |
| 2400 | 40 | < 0.1% |
| 2359 | 91 | |
| 2358 | 82 | |
| 2357 | 83 | |
| 2356 | 79 | |
| 2355 | 101 | |
| 2354 | 114 | |
| 2353 | 103 | |
| 2352 | 103 | |
| 2351 | 101 |
DEP_DELAY
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 1241 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 19858 |
| Missing (%) | 3.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.70068 |
| Minimum | -56 |
|---|---|
| Maximum | 3125 |
| Zeros | 23234 |
| Zeros (%) | 4.2% |
| Negative | 292317 |
| Negative (%) | 53.4% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | -56 |
|---|---|
| 5-th percentile | -10 |
| Q1 | -5 |
| median | -2 |
| Q3 | 12 |
| 95-th percentile | 96 |
| Maximum | 3125 |
| Range | 3181 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 64.175619 |
|---|---|
| Coefficient of variation (CV) | 4.0874419 |
| Kurtosis | 207.03893 |
| Mean | 15.70068 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 10.777086 |
| Sum | 8280743 |
| Variance | 4118.51 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -5 | 36474 | 6.7% |
| -4 | 34213 | 6.3% |
| -3 | 32723 | 6.0% |
| -2 | 30128 | 5.5% |
| -6 | 29530 | 5.4% |
| -1 | 27006 | 4.9% |
| -7 | 25582 | 4.7% |
| 0 | 23234 | 4.2% |
| -8 | 20731 | 3.8% |
| -10 | 16343 | 3.0% |
| Other values (1231) | 251449 | |
| (Missing) | 19858 | 3.6% |
| Value | Count | Frequency (%) |
| -56 | 1 | < 0.1% |
| -47 | 1 | < 0.1% |
| -46 | 1 | < 0.1% |
| -44 | 1 | < 0.1% |
| -43 | 1 | < 0.1% |
| -38 | 1 | < 0.1% |
| -37 | 3 | |
| -36 | 4 | |
| -35 | 2 | < 0.1% |
| -34 | 5 |
| Value | Count | Frequency (%) |
| 3125 | 1 | |
| 2972 | 1 | |
| 2923 | 1 | |
| 2892 | 1 | |
| 2809 | 1 | |
| 2800 | 1 | |
| 2629 | 1 | |
| 2318 | 1 | |
| 2311 | 1 | |
| 2151 | 1 |
TAXI_OUT
Real number (ℝ)
MISSING 
| Distinct | 176 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 20257 |
| Missing (%) | 3.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.809671 |
| Minimum | 1 |
|---|---|
| Maximum | 213 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 12 |
| median | 16 |
| Q3 | 21 |
| 95-th percentile | 39 |
| Maximum | 213 |
| Range | 212 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 11.389781 |
|---|---|
| Coefficient of variation (CV) | 0.60552794 |
| Kurtosis | 20.951346 |
| Mean | 18.809671 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 3.4792089 |
| Sum | 9912960 |
| Variance | 129.72712 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 41318 | 7.5% |
| 12 | 40117 | 7.3% |
| 14 | 38954 | 7.1% |
| 15 | 36324 | 6.6% |
| 11 | 36298 | 6.6% |
| 16 | 31845 | 5.8% |
| 10 | 29060 | 5.3% |
| 17 | 27978 | 5.1% |
| 18 | 24323 | 4.4% |
| 19 | 21038 | 3.8% |
| Other values (166) | 199759 | |
| (Missing) | 20257 | 3.7% |
| Value | Count | Frequency (%) |
| 1 | 11 | < 0.1% |
| 2 | 7 | < 0.1% |
| 3 | 49 | < 0.1% |
| 4 | 142 | < 0.1% |
| 5 | 496 | 0.1% |
| 6 | 1722 | 0.3% |
| 7 | 4853 | 0.9% |
| 8 | 10576 | 1.9% |
| 9 | 18945 | |
| 10 | 29060 |
| Value | Count | Frequency (%) |
| 213 | 1 | < 0.1% |
| 210 | 1 | < 0.1% |
| 188 | 1 | < 0.1% |
| 184 | 2 | |
| 180 | 1 | < 0.1% |
| 176 | 1 | < 0.1% |
| 171 | 2 | |
| 170 | 2 | |
| 169 | 4 | |
| 167 | 1 | < 0.1% |
ARR_TIME
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 1440 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 20633 |
| Missing (%) | 3.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1478.9182 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 705 |
| Q1 | 1102 |
| median | 1513 |
| Q3 | 1920 |
| 95-th percentile | 2251 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 818 |
Descriptive statistics
| Standard deviation | 529.06793 |
|---|---|
| Coefficient of variation (CV) | 0.35773982 |
| Kurtosis | -0.32977478 |
| Mean | 1478.9182 |
| Median Absolute Deviation (MAD) | 409 |
| Skewness | -0.37510433 |
| Sum | 7.7885452 × 108 |
| Variance | 279912.87 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1638 | 622 | 0.1% |
| 1656 | 616 | 0.1% |
| 1645 | 613 | 0.1% |
| 1643 | 613 | 0.1% |
| 1633 | 611 | 0.1% |
| 1728 | 607 | 0.1% |
| 1634 | 606 | 0.1% |
| 1738 | 602 | 0.1% |
| 1740 | 597 | 0.1% |
| 1446 | 597 | 0.1% |
| Other values (1430) | 520554 | |
| (Missing) | 20633 | 3.8% |
| Value | Count | Frequency (%) |
| 1 | 278 | |
| 2 | 242 | |
| 3 | 285 | |
| 4 | 229 | |
| 5 | 234 | |
| 6 | 241 | |
| 7 | 241 | |
| 8 | 252 | |
| 9 | 220 | |
| 10 | 219 |
| Value | Count | Frequency (%) |
| 2400 | 239 | |
| 2359 | 283 | |
| 2358 | 295 | |
| 2357 | 289 | |
| 2356 | 291 | |
| 2355 | 309 | |
| 2354 | 317 | |
| 2353 | 335 | |
| 2352 | 338 | |
| 2351 | 386 |
ARR_DELAY
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 1282 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 21901 |
| Missing (%) | 4.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.352298 |
| Minimum | -90 |
|---|---|
| Maximum | 3136 |
| Zeros | 8971 |
| Zeros (%) | 1.6% |
| Negative | 306999 |
| Negative (%) | 56.1% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | -90 |
|---|---|
| 5-th percentile | -29 |
| Q1 | -16 |
| median | -5 |
| Q3 | 13 |
| 95-th percentile | 97 |
| Maximum | 3136 |
| Range | 3226 |
| Interquartile range (IQR) | 29 |
Descriptive statistics
| Standard deviation | 66.784961 |
|---|---|
| Coefficient of variation (CV) | 6.4512206 |
| Kurtosis | 180.88394 |
| Mean | 10.352298 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 9.786023 |
| Sum | 5438787 |
| Variance | 4460.231 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -11 | 12841 | 2.3% |
| -12 | 12822 | 2.3% |
| -13 | 12661 | 2.3% |
| -10 | 12583 | 2.3% |
| -9 | 12446 | 2.3% |
| -8 | 12421 | 2.3% |
| -14 | 12280 | 2.2% |
| -7 | 11905 | 2.2% |
| -15 | 11851 | 2.2% |
| -16 | 11572 | 2.1% |
| Other values (1272) | 401988 | |
| (Missing) | 21901 | 4.0% |
| Value | Count | Frequency (%) |
| -90 | 1 | < 0.1% |
| -87 | 1 | < 0.1% |
| -86 | 1 | < 0.1% |
| -84 | 2 | < 0.1% |
| -82 | 1 | < 0.1% |
| -81 | 2 | < 0.1% |
| -80 | 1 | < 0.1% |
| -77 | 4 | |
| -76 | 4 | |
| -75 | 5 |
| Value | Count | Frequency (%) |
| 3136 | 1 | |
| 2989 | 1 | |
| 2901 | 1 | |
| 2884 | 1 | |
| 2833 | 1 | |
| 2779 | 1 | |
| 2614 | 1 | |
| 2334 | 1 | |
| 2300 | 1 | |
| 2185 | 1 |
CANCELLED
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| 0.0 | |
|---|---|
| 1.0 | 20389 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1641813 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 526882 | |
| 1.0 | 20389 | 3.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 526882 | |
| 1.0 | 20389 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1074153 | |
| . | 547271 | |
| 1 | 20389 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1641813 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1074153 | |
| . | 547271 | |
| 1 | 20389 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1641813 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1074153 | |
| . | 547271 | |
| 1 | 20389 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1641813 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1074153 | |
| . | 547271 | |
| 1 | 20389 | 1.2% |
CANCELLATION_CODE
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 526882 |
| Missing (%) | 96.3% |
| Memory size | 4.2 MiB |
| B | |
|---|---|
| A | |
| C | 568 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20389 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B |
|---|---|
| 2nd row | A |
| 3rd row | A |
| 4th row | B |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| B | 12085 | 2.2% |
| A | 7736 | 1.4% |
| C | 568 | 0.1% |
| (Missing) | 526882 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| b | 12085 | |
| a | 7736 | |
| c | 568 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 12085 | |
| A | 7736 | |
| C | 568 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 20389 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| B | 12085 | |
| A | 7736 | |
| C | 568 | 2.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 20389 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| B | 12085 | |
| A | 7736 | |
| C | 568 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 20389 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| B | 12085 | |
| A | 7736 | |
| C | 568 | 2.8% |
DIVERTED
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| 0.0 | |
|---|---|
| 1.0 | 1512 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1641813 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 545759 | |
| 1.0 | 1512 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 545759 | |
| 1.0 | 1512 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1093030 | |
| . | 547271 | |
| 1 | 1512 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1641813 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1093030 | |
| . | 547271 | |
| 1 | 1512 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1641813 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1093030 | |
| . | 547271 | |
| 1 | 1512 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1641813 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1093030 | |
| . | 547271 | |
| 1 | 1512 | 0.1% |
| ARR_DELAY | ARR_TIME | CANCELLATION_CODE | CANCELLED | DEP_DELAY | DEP_TIME | DEST_AIRPORT_ID | DEST_AIRPORT_SEQ_ID | DEST_CITY_MARKET_ID | DIVERTED | OP_CARRIER_FL_NUM | OP_UNIQUE_CARRIER | ORIGIN_AIRPORT_ID | ORIGIN_AIRPORT_SEQ_ID | ORIGIN_CITY_MARKET_ID | TAXI_OUT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ARR_DELAY | 1.000 | 0.117 | 0.000 | 1.000 | 0.708 | 0.154 | 0.003 | 0.003 | 0.004 | 1.000 | -0.013 | 0.020 | -0.019 | -0.019 | -0.042 | 0.287 |
| ARR_TIME | 0.117 | 1.000 | 0.000 | 1.000 | 0.150 | 0.760 | 0.021 | 0.021 | 0.045 | 0.033 | 0.033 | 0.056 | -0.014 | -0.014 | -0.049 | -0.028 |
| CANCELLATION_CODE | 0.000 | 0.000 | 1.000 | 1.000 | 0.055 | 0.140 | 0.251 | 0.251 | 0.204 | 1.000 | 0.393 | 0.656 | 0.253 | 0.253 | 0.209 | 0.074 |
| CANCELLED | 1.000 | 1.000 | 1.000 | 1.000 | 0.023 | 0.021 | 0.034 | 0.034 | 0.041 | 0.010 | 0.040 | 0.174 | 0.031 | 0.031 | 0.041 | 0.009 |
| DEP_DELAY | 0.708 | 0.150 | 0.055 | 0.023 | 1.000 | 0.212 | 0.011 | 0.011 | 0.011 | 0.013 | -0.065 | 0.019 | -0.040 | -0.040 | -0.074 | 0.040 |
| DEP_TIME | 0.154 | 0.760 | 0.140 | 0.021 | 0.212 | 1.000 | 0.027 | 0.027 | 0.064 | 0.006 | 0.046 | 0.056 | -0.038 | -0.038 | -0.062 | -0.055 |
| DEST_AIRPORT_ID | 0.003 | 0.021 | 0.251 | 0.034 | 0.011 | 0.027 | 1.000 | 1.000 | 0.623 | 0.018 | -0.027 | 0.173 | -0.004 | -0.004 | -0.024 | 0.027 |
| DEST_AIRPORT_SEQ_ID | 0.003 | 0.021 | 0.251 | 0.034 | 0.011 | 0.027 | 1.000 | 1.000 | 0.623 | 0.018 | -0.027 | 0.173 | -0.004 | -0.004 | -0.024 | 0.027 |
| DEST_CITY_MARKET_ID | 0.004 | 0.045 | 0.204 | 0.041 | 0.011 | 0.064 | 0.623 | 0.623 | 1.000 | 0.012 | -0.015 | 0.169 | -0.024 | -0.024 | -0.065 | 0.029 |
| DIVERTED | 1.000 | 0.033 | 1.000 | 0.010 | 0.013 | 0.006 | 0.018 | 0.018 | 0.012 | 1.000 | 0.007 | 0.026 | 0.006 | 0.006 | 0.010 | 0.017 |
| OP_CARRIER_FL_NUM | -0.013 | 0.033 | 0.393 | 0.040 | -0.065 | 0.046 | -0.027 | -0.027 | -0.015 | 0.007 | 1.000 | 0.395 | -0.011 | -0.011 | 0.002 | 0.113 |
| OP_UNIQUE_CARRIER | 0.020 | 0.056 | 0.656 | 0.174 | 0.019 | 0.056 | 0.173 | 0.173 | 0.169 | 0.026 | 0.395 | 1.000 | 0.173 | 0.173 | 0.169 | 0.067 |
| ORIGIN_AIRPORT_ID | -0.019 | -0.014 | 0.253 | 0.031 | -0.040 | -0.038 | -0.004 | -0.004 | -0.024 | 0.006 | -0.011 | 0.173 | 1.000 | 1.000 | 0.623 | -0.025 |
| ORIGIN_AIRPORT_SEQ_ID | -0.019 | -0.014 | 0.253 | 0.031 | -0.040 | -0.038 | -0.004 | -0.004 | -0.024 | 0.006 | -0.011 | 0.173 | 1.000 | 1.000 | 0.623 | -0.025 |
| ORIGIN_CITY_MARKET_ID | -0.042 | -0.049 | 0.209 | 0.041 | -0.074 | -0.062 | -0.024 | -0.024 | -0.065 | 0.010 | 0.002 | 0.169 | 0.623 | 0.623 | 1.000 | -0.039 |
| TAXI_OUT | 0.287 | -0.028 | 0.074 | 0.009 | 0.040 | -0.055 | 0.027 | 0.027 | 0.029 | 0.017 | 0.113 | 0.067 | -0.025 | -0.025 | -0.039 | 1.000 |
| FL_DATE | OP_UNIQUE_CARRIER | OP_CARRIER_FL_NUM | ORIGIN_AIRPORT_ID | ORIGIN_AIRPORT_SEQ_ID | ORIGIN_CITY_MARKET_ID | ORIGIN | ORIGIN_CITY_NAME | DEST_AIRPORT_ID | DEST_AIRPORT_SEQ_ID | DEST_CITY_MARKET_ID | DEST | DEST_CITY_NAME | DEP_TIME | DEP_DELAY | TAXI_OUT | ARR_TIME | ARR_DELAY | CANCELLED | CANCELLATION_CODE | DIVERTED | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1/1/2024 12:00:00 AM | 9E | 4814 | 12478 | 1247805 | 31703 | JFK | New York, NY | 11433 | 1143302 | 31295 | DTW | Detroit, MI | 1247.0 | -5.0 | 31.0 | 1449.0 | -19.0 | 0.0 | NaN | 0.0 |
| 1 | 1/1/2024 12:00:00 AM | 9E | 4815 | 13487 | 1348702 | 31650 | MSP | Minneapolis, MN | 11042 | 1104205 | 30647 | CLE | Cleveland, OH | 1001.0 | -14.0 | 20.0 | 1255.0 | -30.0 | 0.0 | NaN | 0.0 |
| 2 | 1/1/2024 12:00:00 AM | 9E | 4817 | 12478 | 1247805 | 31703 | JFK | New York, NY | 14524 | 1452401 | 34524 | RIC | Richmond, VA | 1411.0 | -4.0 | 21.0 | 1541.0 | -20.0 | 0.0 | NaN | 0.0 |
| 3 | 1/1/2024 12:00:00 AM | 9E | 4817 | 14524 | 1452401 | 34524 | RIC | Richmond, VA | 12478 | 1247805 | 31703 | JFK | New York, NY | 1643.0 | -7.0 | 13.0 | 1759.0 | -42.0 | 0.0 | NaN | 0.0 |
| 4 | 1/1/2024 12:00:00 AM | 9E | 4818 | 11433 | 1143302 | 31295 | DTW | Detroit, MI | 13342 | 1334207 | 33342 | MKE | Milwaukee, WI | 1010.0 | -5.0 | 21.0 | 1020.0 | -14.0 | 0.0 | NaN | 0.0 |
| 5 | 1/1/2024 12:00:00 AM | 9E | 4822 | 12451 | 1245102 | 31136 | JAX | Jacksonville, FL | 12953 | 1295304 | 31703 | LGA | New York, NY | 1403.0 | -7.0 | 14.0 | 1603.0 | -24.0 | 0.0 | NaN | 0.0 |
| 6 | 1/1/2024 12:00:00 AM | 9E | 4822 | 12953 | 1295304 | 31703 | LGA | New York, NY | 12451 | 1245102 | 31136 | JAX | Jacksonville, FL | 947.0 | -8.0 | 26.0 | 1231.0 | -13.0 | 0.0 | NaN | 0.0 |
| 7 | 1/1/2024 12:00:00 AM | 9E | 4823 | 10994 | 1099402 | 30994 | CHS | Charleston, SC | 12953 | 1295304 | 31703 | LGA | New York, NY | 1135.0 | -5.0 | 8.0 | 1314.0 | -24.0 | 0.0 | NaN | 0.0 |
| 8 | 1/1/2024 12:00:00 AM | 9E | 4823 | 12953 | 1295304 | 31703 | LGA | New York, NY | 10994 | 1099402 | 30994 | CHS | Charleston, SC | 810.0 | -5.0 | 14.0 | 1013.0 | -31.0 | 0.0 | NaN | 0.0 |
| 9 | 1/1/2024 12:00:00 AM | 9E | 4828 | 12397 | 1239703 | 32397 | ITH | Ithaca/Cortland, NY | 12478 | 1247805 | 31703 | JFK | New York, NY | 1248.0 | -12.0 | 12.0 | 1355.0 | -24.0 | 0.0 | NaN | 0.0 |
| FL_DATE | OP_UNIQUE_CARRIER | OP_CARRIER_FL_NUM | ORIGIN_AIRPORT_ID | ORIGIN_AIRPORT_SEQ_ID | ORIGIN_CITY_MARKET_ID | ORIGIN | ORIGIN_CITY_NAME | DEST_AIRPORT_ID | DEST_AIRPORT_SEQ_ID | DEST_CITY_MARKET_ID | DEST | DEST_CITY_NAME | DEP_TIME | DEP_DELAY | TAXI_OUT | ARR_TIME | ARR_DELAY | CANCELLED | CANCELLATION_CODE | DIVERTED | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 547261 | 1/31/2024 12:00:00 AM | YX | 5837 | 12953 | 1295304 | 31703 | LGA | New York, NY | 13871 | 1387102 | 33316 | OMA | Omaha, NE | 1846.0 | -9.0 | 15.0 | 2051.0 | -41.0 | 0.0 | NaN | 0.0 |
| 547262 | 1/31/2024 12:00:00 AM | YX | 5838 | 11433 | 1143302 | 31295 | DTW | Detroit, MI | 14122 | 1412202 | 30198 | PIT | Pittsburgh, PA | 1615.0 | -4.0 | 28.0 | 1728.0 | -3.0 | 0.0 | NaN | 0.0 |
| 547263 | 1/31/2024 12:00:00 AM | YX | 5838 | 14122 | 1412202 | 30198 | PIT | Pittsburgh, PA | 11433 | 1143302 | 31295 | DTW | Detroit, MI | 1825.0 | -5.0 | 19.0 | 1936.0 | -15.0 | 0.0 | NaN | 0.0 |
| 547264 | 1/31/2024 12:00:00 AM | YX | 5840 | 11433 | 1143302 | 31295 | DTW | Detroit, MI | 11066 | 1106606 | 31066 | CMH | Columbus, OH | 1606.0 | -4.0 | 17.0 | 1701.0 | -11.0 | 0.0 | NaN | 0.0 |
| 547265 | 1/31/2024 12:00:00 AM | YX | 5842 | 10721 | 1072102 | 30721 | BOS | Boston, MA | 12478 | 1247805 | 31703 | JFK | New York, NY | 1552.0 | -8.0 | 17.0 | 1700.0 | -25.0 | 0.0 | NaN | 0.0 |
| 547266 | 1/31/2024 12:00:00 AM | YX | 5843 | 12953 | 1295304 | 31703 | LGA | New York, NY | 14492 | 1449202 | 34492 | RDU | Raleigh/Durham, NC | 1201.0 | 51.0 | 29.0 | 1347.0 | 38.0 | 0.0 | NaN | 0.0 |
| 547267 | 1/31/2024 12:00:00 AM | YX | 5844 | 12953 | 1295304 | 31703 | LGA | New York, NY | 11278 | 1127805 | 30852 | DCA | Washington, DC | 2016.0 | -14.0 | 16.0 | 2128.0 | -32.0 | 0.0 | NaN | 0.0 |
| 547268 | 1/31/2024 12:00:00 AM | YX | 5845 | 10821 | 1082106 | 30852 | BWI | Baltimore, MD | 12478 | 1247805 | 31703 | JFK | New York, NY | 1719.0 | 3.0 | 11.0 | 1827.0 | -18.0 | 0.0 | NaN | 0.0 |
| 547269 | 1/31/2024 12:00:00 AM | YX | 5845 | 12478 | 1247805 | 31703 | JFK | New York, NY | 10821 | 1082106 | 30852 | BWI | Baltimore, MD | 1552.0 | 31.0 | 15.0 | 1653.0 | 19.0 | 0.0 | NaN | 0.0 |
| 547270 | 1/31/2024 12:00:00 AM | YX | 5846 | 15096 | 1509602 | 35096 | SYR | Syracuse, NY | 12953 | 1295304 | 31703 | LGA | New York, NY | 559.0 | -1.0 | 14.0 | 708.0 | -23.0 | 0.0 | NaN | 0.0 |